Scientific journal publishers and omitted citations in bibliometric databases: Any relationship?

نویسندگان

  • Fiorenzo Franceschini
  • Domenico A. Maisano
  • Luca Mastrogiacomo
چکیده

. Introduction and literature review Bibliometric databases, like any database, are not free from errors. Despite the improved accuracy over the past ten ears – probably due to the systematic employ of automatic tools for correcting errors in cited article lists by editors and atabase administrators (Adam, 2002) – the problem is far from being solved. This is proven by (i) several recent articles ocumenting the existence of errors of different nature – e.g., (Franceschini & Maisano, 2011a; Jacsó, 2012) – and (ii) the act that bibliometric database staff constantly encourage users to report any noticed inaccuracy. A synthetic classification of the major database errors is reported in Table 1, distinguishing between authors’ and database apping errors. The contributions by Buchanan (2006), Jacsó (2006) and Li, Burnham, Lemley, and Britton (2010) and Olensky 2013) show that one of the main consequences of these errors is represented by omitted citations, i.e., citations that should e ascribed to a certain (cited) paper but, for some reason, are lost. In other terms, the link between citing and cited article s not established by the database. Even though the scientific literature reports numerous notifications of blunders (sometimes grotesque!) by Google cholar (Labbé, 2010), it often ignores the citations omitted by the two major multidisciplinary databases: Scopus and eb of Science (WoS). According to the study by Buchanan (2006), citations omitted by WoS are likely to be around 5–10% ∗ Corresponding author. Tel.: +39 0907225. E-mail addresses: [email protected] (F. Franceschini), [email protected] (D. Maisano), [email protected] L. Mastrogiacomo). http://dx.doi.org/10.1016/j.joi.2014.07.003 751-1577/© 2014 Elsevier Ltd. All rights reserved. 752 F. Franceschini et al. / Journal of Informetrics 8 (2014) 751–765 Table 1 Classification of bibliometric database errors according to Buchanan (2006). Error type Author errors Database mapping errors Definition Errors made by authors when creating the list of cited articles for their publication Failure to establish an electronic link between a cited article and the corresponding citing articles that can be attributed to a data-entry error Examples Errors in name and initials of the first author Errors in publication title Errors in publication year Transcription errors Target-source article record errors Cited article omitted from a cited-article list Errors in volume number Errors in pagination. Reason unknown of the “true” number of citations (i.e., the number of citations that would be indexed in the ideal case of absence of omitted citations). Unfortunately, most of the contributions on database errors rely on the manual analysis of small samples of scientific articles and therefore results are not very robust statistically. To overcome this obstacle, Franceschini, Maisano, and Mastrogiacomo (2013) introduced an algorithm for estimating a database’s omitted-citation rate automatically. This algorithm requires the combined use of two or more bibliometric databases and is based upon the hypothesis that the mismatch between the citations occurring in one database and another one is evidence of possible errors/omissions. The automated algorithm has been recently applied by Franceschini, Maisano, and Mastrogiacomo (2014) to a large set of journals in the Manufacturing field, showing that – with a few exceptions – the differences in terms of omitted-citation rate (p) between these journals is included between 4% and 10% for WoS and between 2% and 8% for Scopus. The same study showed that omitted citations are usually “isolated accidents”, which concern a very small portion of the articles published by a certain journal; nevertheless, these errors may affect indicators based on citation statistics significantly. Going back to the classification in Table 1, it is not unreasonable to guess that some editorial styles imposed by certain publishers could hamper the correct identification or the citing/cited papers by a database. For example, citing papers containing lists of references with (i) abbreviated journal titles (ISO 4:1997, 1997; Thomson Reuters, 2014a), (ii) first authors’ names only, or (iii) omitted paper titles (not unusual for journals in the Physics field) could complicate the identification of cited papers. Apart from the list of references, other features concerning publishers could affect the database propensity to omit citations, such as: the type of data made available (PDF or HTML version of the articles) and the ability to detect and correct pre-existing errors in the cited article list of a manuscript, before publication. We remark that database errors often result from pre-existing author errors, which are unnoticed throughout the production/indexing process of a paper. In the best cases, reviewers, publishers or even database staff (in chronological sequence) are able to detect and correct them. In the worst cases, they may ignore these errors and even generate new ones. The objective of this paper is to investigate the possible relationship between omitted citations and publishers of the relevant citing papers (e.g., Elsevier, Springer, Taylor & Francis, etc.). The analysis relies on the same dataset used in (Franceschini et al., 2014) – which concern (cited) papers in the field of Manufacturing Engineering – and focuses the attention on the publishers of the citing articles; omitted citations are examined from the perspective of both Scopus and WoS. The remaining of the paper is organized into four sections. Section 2 recalls the automated algorithm for analysing omitted citations. Section 3 provides a detailed illustration of the methodology of data collection and analysis. Section 4 presents the analysis results. The concluding section summarizes the original contributions of this paper, focusing the attention on the relevant implications, limitations and ideas for future research. 2. Automated algorithm for analysing the omitted citations Before recalling the algorithm, we present an introductory example to illustrate how it works. Let us consider a fictitious paper of interest indexed by Scopus and WoS. The number of citations received by this paper is 13 in Scopus and 12 in WoS (see Table 2). The union of the citations recorded by the two databases is a total of nineteen citations. Among the citing articles, only nine belong to sources (i.e., journals or conference proceedings) officially covered by both databases (highlighted in grey in Table 2). Focusing on these nine “theoretically overlapping” (TO) citing articles, two are omitted by Scopus (but not by WoS) and one is omitted by WoS (but not by Scopus). Therefore, from the perspective of the paper of interest, a rough estimate of the omitted-citation rate is 2/9 ≈ 22.2% in Scopus and 1/9 ≈ 11.1% in WoS. The same reasoning can be extended to multiple papers of interest and more than two bibliometric databases. Let us now focus attention on the automated algorithm, which is based on the combined use of two bibliometric databases (Scopus and WoS in this case) and can be summarized in three steps: 1. Identify a set of (P) papers of interest, indexed by both the databases. F. Franceschini et al. / Journal of Informetrics 8 (2014) 751–765 753 Table 2 Citation statistics relating to a fictitious article, according to Scopus and WoS. The union of the citations recorded by the two databases (see the first column) is a total of nineteen citations. Among the citing articles, only nine belong to sources officially covered by both databases (highlighted in grey). Citing Article No. Citations in Scopus WoS 1 √ 2 √ 3 √ √ 4 √ √ 5 √ √ 6 Omitted √ 7 √ 8 √ √ 9 √ √ 10 √ 11 Omitted √ 12 √ √ 13 √ 14 √ Omitted 15 √ 16 √ 17 √ 18 √ 19 √

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

On the Correction of "Old" Omitted Citations by Bibliometric Databases

Omitted citations – i.e., missing links between a cited paper and the corresponding citing papers – are the main consequence of several bibliometric-database errors. To reduce these errors, databases may undertake two actions: (i) improving the control of the (new) papers to be indexed, i.e., limiting the introduction of “new” dirty data, and (ii) detecting and correcting errors in the papers a...

متن کامل

A novel approach for estimating the omitted-citation rate of bibliometric databases with an application to the field of bibliometrics

One of the most significant inaccuracies of bibliometric databases is that of omitted citations, namely, missing electronic links between a paper of interest and some citing papers, which are (or should be) covered by the database. This paper proposes a novel approach for estimating a database’s omitted-citation rate, based on the combined use of 2 or more bibliometric databases. A statistical ...

متن کامل

عملکرد مجلات علوم پزشکی ایران نمایه شده در پایگاه گزارش استنادی نشریات

Introduction: With an increasing number of Iranian scientific journals indexed by international databases, assessment of their performance and status is also needed. The present study aims to investigate the international performance of Iranian medical journals. Methods: To conduct the current study, bibliometric research method has been applied. International statuses of 21 Iranian ...

متن کامل

Coverage, field specialisation and the impact of scientific publishers indexed in the Book Citation Index

Purpose: The aim of this study is to analyze the disciplinary coverage of the Thomson Reuters' Book Citation Index database focusing on publisher presence, impact and specialization. Design/Methodology/approach: We conduct a descriptive study in which we examine coverage by discipline, publisher distribution by field and country of publication, and publisher impact. For this the Thomson Reuters...

متن کامل

Selection of Core Dental Journals using Citation Analysis of Scientific Journals in the Field of Dentistry

  Objective: Citation analysis is a bibliometric method used for the calculation of the mean number of citations and detection of highly cited references, frequency distribution of citations in different languages, update rate, efficacy of resources and number of core journals of a scientific domain. This process elucidates the information needs of researchers in a specific realm of science and...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • J. Informetrics

دوره 8  شماره 

صفحات  -

تاریخ انتشار 2014